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Abstract. We propose a homogenized filter for multiscale signals, wliicli allows to reduce the 
dimension of the system. We prove that the nonlinear filter converges to our homogenized filter. 
This is achieved by a suitable asymptotic expansion of the dual of the Zakai equation, and by 
probabilistically representing the correction terms with the help of BDSDEs. 
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1. Introduction 

[ Filtering theory is an established field in applied probability and decision and control sys- 

QQ ' tems, which is important in many practical applications from inertial guidance of aircrafts and 

Q> . spacecrafts to weather and climate prediction. It provides a recursive algorithm for estimating 

(N ■ a signal or state of a random dynamical system based on noisy measurements. More precisely, 

I filtering problems consist of a signal process X '= {Xt : t > 0} which is unobservable and 

an observation process Y = {Yt : t > 0} which is a function of X corrupted by noise. The 
main objective of filtering theory is to get the best estimate of Xt based on the information 
= a{Ys : < s <t}. This is given by the conditional distribution vr^ of Xt given yt or 
^ I equivalently, the conditional expectations E[/(Xf)|3^t] for a rich enough class of functions. Since 

;_j ■ this estimate minimizes the mean square error loss, we call vr^ the optimal filter. The goal of 

filtering theory is to characterize this conditional distribution effectively. In simplified problems 
where the signal and the observation models are linear and Gaussian, the filtering equation is 
finite-dimensional, and the solution is the well-known Kalman-Bucy filter. In more realistic 
problems, nonlinearities in the models lead to more complicated equations for TVt, defined by 
Zakai (1969) and Fujisaki et al. (1972), which describe the evolution of the conditional distribu- 
tion in the space of probability measures (see, for example. Bain and Crisan (2009), Kallianpur 
(1980), Liptser and Shiryaev (2001)). 

It is impractical to implement a numerical solution to such infinite dimensional stochastic 
evolution equations of the general nonlinear filtering problem by finite difference or finite element 
approximations. Therefore, extended Kalman filter algorithms, which use linear approximations 
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to the signal dynamics and observation, have been used extensively in several applications. 
These provide essentially a first order approximation to an infinite dimensional problem and 
can perform quite poorly in problems with strong nonlinearities. Particle filters have been well 
established for the implementation of nonlinear filtering in science and engineering applciations. 
Doucet et al. (2001) and Arulampalam et al. (2002) provide comprehensive insight into particle 
filtering. However, due to dimensionality issues (see, for example, Snyder et al. (2008)) and 
computational complexities that arise in representing the signal density using a high number 
of particles, the problem of particle filtering in high dimensions is still not completely resolved. 
As a result of these difficulties, we have established a novel particle filtering method Park et al. 
(2011) for multiscale signal and observation processes that combines the homogenization with 
filtering techniques. The theoretical basis for this new capability is presented in this paper. 

The results presented here are set within the context of slow-fast dynamical systems, where 
the rates of change of different variables differ by orders of magnitude. Multiple time scales 
occur in models throughout the science and engineering field. For example, climate evolution is 
governed by fast atmospheric and slow oceanic dynamics and state dynamics in electric power 
systems consists of fast- and slowly-varying elements. This paper addresses the effects of the 
multiscale signal and observation processes via the study of the Zakai equation. We construct 
a lower dimensional Zakai equation in a canonical way. This problem has also been studied 
in Park et al. (2010) using a different approach from what is presented here. In moderate 
dimensional problems, particle filters are an attractive alternative to numerical approximation 
of the stochastic partial differential equations (SPDEs) by finite difference or finite element 
methods. For the reduced nonlinear model an appropriate form of particle filter can be a viable 
and useful scheme. Hence, Namachchivaya et al. (2012) presents the numerical solution of the 
lower dimensional stochastic partial differential equation derived here, as it is applied to several 
higher dimensional multiscale applications. 

In general, this paper provides rigorous mathematical results that support the numerical 
algorithms based on the idea that stochastically averaged models provide qualitatively use- 
ful results which are potentially helpful in developing inexpensive lower-dimensional filter- 
ing as demonstrated by Park et al. (2011) in the context of homogenized particle filters and 
by Harlim and Kang (2011) in the context of averaged ensemble Kalman filters. The convergence 
of the optimal filter to the homogenized filter is shown using backward stochastic differential 
equations (BSDEs) and asymptotic techniques. 

Let us describe the main result. We assume the signal is given as solution of the two time 
scale stochastic differential equation (SDK) 



Here X' is the slow component and Z' is the fast component. We assume that for every fixed 
X, the solution Z^' of 



is ergodic and converges rapidly to its unique stationary distribution. In this case it is well 
known that X'^ converges in distribution to a diffusion X^ which is governed by an SDE 



This X is used to construct an averaged filter vr . We denote the optimal filter for the full 
system by vr^. Define the x-marginal of vr^ as vr^'^, i.e. 



dXf = h{Xl, Zl)dt + a{Xf, Zf)dVt 

dZl = i/(Xf , Z!)dt + -^giXl Zt)dWt. 



dZf = f{x,Zf)dt + g{x,Zf)dWt 




b{X^)dt + a{X^)dVt. 




Our main result is then 
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Theorem. Under the assumptions stated in Theorem 3.1, for every p > 1 and T > there 
exists C > 0, such that for every G 

In particular, there exists a metric d on the space of probability measures, such that d generates 
the topology of weak convergence, and such that for every T > there exists C > such that 

Eq [d(7r^^4)] < ,/IC. 

We begin in Section 2 by presenting the general formulation of the multiscale nonlinear filtering 
problem. Here we describe the measure-valued Zakai equation and introduce the homogenized 
equations that we seek to derive for the reduced dimension unnormalized filter. Section 3 presents 
the formal asymptotic expansion of the multi scale Zakai equation that results in several SPDEs. 
We also present the main results of this paper in this section. Section 4 provides the probabilistic 
representation of the SPDEs, that is, we describe the solutions of the infinite dimensional SPDEs 
by finite dimensional backward doubly stochastic differential equations (BDSDEs). We restate 
some of the results in this context due to Rozovskii (1990) and Pardoux and Peng (1994) at the 
end of this section. We present some of the preliminary results of Pardoux and Veretennikov 
(2003) on convergence of the transition function of in section 5. These estimates are used in 
the proof of the main results presented in section 6. 



2. Formulation of multiscale nonlinear filtering problems 

Let (0, J^, (J"t), Q) be a filtered probability space that supports a {k -\- I + d)-dimensional 
standard Brownian motion (y,W,B). Consider the signal (X^,Z^) to be a two time scale 
diffusion process with a fast component Z^ and a slow component X^: 

(1) dXf = b{Xf, Zf)dt + a{Xf, Zf)dVt 

dZl = -f{Xt, Zl)dt + ^g{Xt, Zf)dWt, 

£ \J E 

where XI € R™, Zl G M", Wt G M' and € M'^ are independent standard Brownian motions, 
h ■ M™+" ^ M™, o- : M™+"' ^ M™^'^, / : M™+" ^ M", 5 : ^ M"^^ All the functions above 

are assumed to be Borel- measurable. For fixed x G M™, define 

(2) dZf = fix, Zf)dt + g{x, Zf)dWt. 

Assume that for all x € M™, Z^ is ergodic and converges rapidly towards its stationary measure 
//(x, •). We will make this precise later. 

The d-dimensional observation is given by 

ft 



Yf= [ h{XlZl)ds + Bt 
Jo 



'0 

with Borel- measurable h : M™"*"" — )• M'^. is assumed to be a d-dimensional standard Brownian 
motion that is independent of W and V. 

Define = cr{Yg : < s <t) V A/", where M are the Q-negligible sets. For a finite measure vr 
on ]R'"+" and for a bounded measurable function if on denote 7r((/9) = J ip{x, z)7r{dx,dz). 

Then our aim is to calculate the measure- valued process (vrf , t > 0) determined by 



7rt{f) = EMX!,z!)\y, 

Define the Girsanov transform 



£1 



Df = exp h{Xl ZlYdBs -IJ^ \h{Xl Zl)\^ds 
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Under P*^, the observation process, y^, is a Brownian motion and independent of {X^ ^ 
the Kalhanpur-Striebel formula, 



By 



¥.Q[^{xizi)\y\ 







Tt 






dQ 









with 

So if we define 
then 



D\ = exp 



Tt 



\KXl,Zl)\^ds 



(^(Xf,Zf)exp / h(Xl,ZirdY, 



UXl,Zl)\'ds 



yi 



pf (1) ' 



Denote by £^ = ^^Ci? + £5 the differential operator associated to (X^, Z^). That is, 

d 1 A ... . 



m 



m 



dzjdzi 



d 1 "' 9^ 



where •* denotes the transpose of a matrix or a vector. 

Then the unnormalized measure- valued process, p^, satisfies the Zakai equation: 

(3) dpU^) = pl{C'ip)dt + pl{h^)dYt' 

for every Lp E C^(M™+",M) (see, for example. Bain and Crisan (2009)). For k > 0, Cj^ is the 
space of k times continuously differentiable functions /, such that / and all its partial derivatives 
up to order k are bounded. 

From the theory of stochastic averaging (see, for example, Papanicolaou et al. (1977)), we 
know that under suitable conditions, X"^ converges in law to X^ as e — )• 0, where X^ is the 
solution of an SDE 

dXf = b{Xf)dt + a{X^)dWt 

for suitably averaged b and a. Denote the generator of X^ by C. 

We want to show that as long as we are only interested in estimating the slow component, we 
can take advantage of this fact. More precisely, we want to find a homogenized (unnnormalized) 
filter such that for small e, p^'^ which is the x-marginal of /?f , is close to p^. The x-marginal 
of is defined as 

e,x 



pTi'P) = / ip{x)pl{dx,dz) 

for every measurable bounded (p : — t- M, and p^ is the solution of 
(4) dp'i{^)=p'}{Cip)dt + pUh^)dYf 

pO(¥p)=EQ[(^(xO)], 
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where /i is a suitably averaged version of h. The measure-valued processes vr*^ and vr'"'^ are then 
defined in terms of and p^'^ as vr^ was defined in terms of p^' 



0/ ^ Ptif) , e,x^ . Pi V2 

TTt {.^) = and vr^' v?) = 

Pt W 

Note that the homogenized filter is still driven by the real observation and not by a "ho- 
mogenized observation", which is practical for implementation of the homogenized filter in ap- 
plications since such homogenized observation is usually not available. However, should such 
homogenized observation be available, using it would lead to loss of information for estimating 
the signal compared to using the actual observation. 

In this paper, we will prove L^-convergence of the actual filter to the homogenized filter, i.e. 
we will show that for any T > 0, 

limE rci(7rS^^,7r?,)l = 0, 

where d denotes a suitable distance on the space of probability measures that generates the 
topology of weak convergence. This convergence result is shown in Park et al. (2010) for a 
two-dimensional multiscale signal process with no drift in the fast component SDE. Here, we 
extend the result to an M^+^-dimensional signal process with drift and diffusion coefficients of 
the fast and slow components dependent on both components. The proof of Park et al. (2010) is 
based on representing the slow component as a time-changed Brownian motion under a suitable 
measure, which cannot be extended easily to the multidimensional setting we assume here. 

Based on (3) and (4), the filter convergence problem is a problem of homogenization of a 
SPDE. Homogenization of diffusion processes with periodic structures is done using the mar- 
tingale problem approach in Papanicolaou et al. (1977) and limit behavior of stochastic pro- 
cesses is studied using asymptotic analysis in Papanicolaou and Kohler (1975) and Chapter 2 
of Bensoussan et al. (1978). Bensoussan et al. (1978) studies linear SPDEs with periodic coef- 
ficients and also used a probabilistic approach in Chapter 3. Homogenization for the nonlinear 
filtering problem is studied in Bensoussan and Blankenship (1986) and Ichihara (2004), where 
asymptotic analysis is performed on a dual representation of the nonlinear filtering equation. 
Ichihara (2004) studies homogenization for Zakai-type SPDEs using two different approaches 
- the martingale problem approach and BSDE techniques. Our convergence proof applies 
BSDE techniques by invoking the dual representation of the filtering equation and using as- 
ymptotic analysis to determine the limit behavior of the solution of the backward equation. 
Pardoux and Veretennikov (2003) give precise estimates for the transition function of an er- 
godic SDE of the type (2), and these results are used in our proof. To our knowledge, such 
method of homogenization for SPDEs using BSDE and asymptotic methods has not been done 
before. 

For a given bounded test function and terminal time T, we follow Pardoux (1979) in intro- 
ducing the associated dual process u^' '"^{x^ z), which is a dynamic version of Epe[(/?(X|.)Z)jn|3^j.]: 

vf'''{x,z) = Epe^^j^{x^)Dy\yiT] 

where Pfx z is the measure under which and are governed by the same dynamics as 
under P^, but {X^ , Z^) stays in {x,z) until time t, then it starts to follow the SDE dynamics. 
DI j, = D^{Diy^; and y^j, = cj(y/ - Yf : t < r < T) \/ J\f (recaU that Af denotes the 
Q-negligible sets). From the Markov property of {X'^ , Z^) it follows that for any t G [0, T]: 
Pi{vl' ''^) = p^^{ip). In particular (because at time 0, p^ is just the starting distribution of 
(X^Z-)): 

e,Xf \ / e,T,ip 
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Similarly introduce 
where 



and Pf^ is the measure under which is governed by the same dynamics as under P^, but 
stays in x until time t. We can also show that for any t G [0,r]: pf{v^''^''^) = pj^{ip), so that 



Note that Qx^ ~ 'Q^o' tiecause the homogenized process has the same starting distribution as 
the unhomogenized one. 

Now fix T andipe C|(M", M) and write = and v'^ = 1;°'^''^. 

Our aim is to show that for nice test functions 93, and for the dual processes and defined 
above, E[|t;g(a;, z) — Vq{x)\p] is small (in a way that will depend on x and z). Then 

n\p/{^) - Prim = E I {vUx,z) - v^,{x))Q^x^^z^){dx,dz) 

<e \ [ \v'oix,z) - v°o{x)\PQ^xi,zi^^)idx,dz) 

E[\v'^{x, z) - vl{xWn^x^^^zi){dx, dz) 
will also be small as long as Q(xg,Z5) is well behaved. 

3. Formal expansions of the filtering equations and the main results 

Before we continue, let us change notation: For large parts of this article we will only work un- 
der P"^, and the process is a Brownian motion under P*" which is independent of {X"^ , ^ X^). 
Therefore from now on we write P instead of P^ and B instead of to facilitate the reading. 
The distribution and notation for the Markov processes (X^, Z^,X^) do not change. 

The key point is now that v'^ and solve backward SPDEs: 

<— 

(5) —dvf{x, z) = C^vf{x, z)dt + h{x, z)*vf{x, z)dBt 

Vt{x,z) = ip{x) 

and 

(6) -dv^t{x) = Cv'^ix, z)dt + h{xyv'^{x)dBt 

v^{x) = ip{x). 

Here and everywhere in this article, dB denotes Ito's backward integral. 
We formally expand as 

vf{x, z) = z) + eu\i^ (x, z) + e'^u^/^{x, z). 

Note that rigorously this does not make any sense, because: 

• We work with equations with terminal conditions. But when we send e — t- 0, then t/e 
converges to infinity. So for which time should the terminal condition of e.g. be 
defined? 

• The terms in this expansion will all be stochastic. Then if is adapted to J-^ , the 
stochastic integral u 1 (x, z)dBs a priori does not make any sense for e < 1. 
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However if we do such a formal asymptotic expansion, and then call 

v^{t,x) = u^{t,x), tp^ {t, X, z) = eul^^{x, z), R{t,x, z) = e'^uf^^{x, z) 

(of course all terms except depend on e, which we omit in the notation to facilitate the 
reading), then these terms have to solve the following equations: 

-dv'^ix) = Cv'^ix, z)dt + h{x)*v^^{x)dBt 

(7) -dij]{x,z) = ^CFi^lix,z)dt + {Cs- C)v'^{x)dt+ {h{x,z) - h{x))* v^{x)dBt 

(8) -dRt{x,z) = C''Rtix,z)dt + Csipl{x,z)dt + h{x,z)* {tpl{x,z) +Rtix,z)) dBt 
with terminal conditions 

v^{T, x) = Lp{x), ip^{T, X, z) = R{T, X, z) = 0. 

Note that the equation for is exactly the desired equation (6). By existence and uniqueness 
of the solutions to these linear equations, we can apply superposition to obtain that then indeed 

vl{x, z) = v^{x) + V't (x, z) + Rt{x, z). 

Therefore showing L^-convergence of v'^ to reduces to showing L^-convergence oi + R to 
0. To achieve this, we will give probabilistic representations of Tp^ and R in terms of backward 
doubly stochastic differential equations. This will allow us to apply the existing estimates for 
the transition function of from Pardoux and Veretennikov (2003). 

It will be convenient for us to work with functions that are smoother in their x-component 
than they are in their z-component or vice versa. To do so, introduce the function spaces 
Ck,i(j^rn ^ ]^n^-^dy_ p^^. g| . j^m ^ M" ^ M'^, 6* = e{x,z), Write e G C'^'^iW" X R",]R'^), if e is 
k times continuously differentiable in its x-components and / times continuously differentiable 
in its z-components. If 6 as well as its partial derivatives up to order {k, I) are bounded, write 
e G C^'\r"' X M",M'^). 

Introduce the following assumptions: 

(Hstat) For the existence of a stationary distribution /i(x, dz) for Z^, we suppose that there exist 
Mo > 0, a > 0, such that for all \z\ > Mq 

sup(/(x,z),z) < -C\zf. 

X 

For the uniqueness of the stationary distribution ii{x,dz) of Z^, we suppose uniform 
ellipticity, i.e. that there are < A < A < oo, such that 

XI<gg*{x,y)<AI 

in the sense of positive semi-definite matrices (/ is the unit matrix). 
(RFk,i) The coefficients of the fast diffusion satisfy / G C^'\w^ x ]R",R") and g G Cj^'^W^ x 
M",R">^'=). 

(HSfc,z) The coefficients of the slow diffusion satisfy b G Cj^'^R"" x M",M"^) and a G Cj^'^R"^ x 

(HOfc,z) The observation function h satisfies h G C^'\R"' x M",M'^). 

We will usually write poo{x,dz) instead of fi{x,dz). Also introduce the notation 

pt{z,e;x):= [ e{x,z')pt{z,z';x)dz' ■.= EMZf)] 

where z denotes the starting point of Z^, and z' i— )• pt{z, z'; x) is the density of Zf if at time 
it is started in z. Note that the density exists for all t > under the condition (Hstat), because 
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of the uniform ellipticity of gg* . Similarly 

Poo{0;x)= / 9{x,z)poo{x,dz). 



Let the differential operator £ be defined as 

a 1 A . . 92 



dxi 2 ^ ' dxidxj 
1=1 *J=i 

wYieiehix) = poo{h;x) and a = poo{(Ta*;x). Also define h{x) = poo{h; x). 

We introduce the following notation: A multiindex a = {ai, . . . , am) S Nq is of order 

\a\ = ai H + Um- 

Given such a multiindex, define the differential operator 

= 

Finally introduce the following norms for / G Cf'(]R™', M"): 



|a|<fc 



fc,oo= 7 , Il^"/Ilc 



where || • ||oo is the usual supremum norm. 
Our main result is 



Theorem 3.1. Assume (Hstat), (HF^^i), (HSj^i), (HOs^a), and that the initial distribution 
[x~,z^) ^o.s finite moments of every order. Then for every p > 1 and T > there exists 

iPi\i/p 



C > 0, such that for every tp £ 



In particular, there exists a metric d on the space of probability measures, such that d generates 
the topology of weak convergence, and such that for every T > there exists C > 0, such that 

This result will be proven in Section 6. 

In particular we can use Borel-Cantelli to conclude that if (e„) converges quickly enough to 
0, then tt''" will a.s. converge weakly to vr*^. 

The ideas are rather simple: We represent the backward SPDEs by finite-dimensional sto- 
chastic equations (this will be BDSDEs). The diffusion operators get replaced by the associated 
diffusions. We are able to solve those finite-dimensional equations explicitly, or at least give 
explicit estimates up to an application of Gronwall. This allows us to estimate and R in 
terms of the transition function of the fast diffusion. But Pardoux and Veretennikov (2003) 
proved very precise estimates for this transition function. These estimates allow us to obtain 
the convergence. 

While the ideas are simple, the precise formulation and the actual proofs are quite technical. 
We start by describing the probabilistic representation. 

4. Probabilistic representation of SPDEs 
In this section, we derive probabilistic representations for SPDEs of the form 

(9) —dip{cL!, t, x) = C'ip{uj, t, x)dt + f{oj, t, x)dt + {g{oj, t, x) + G{uj, t, x)ip{uj, t, x))dBt 
ij(T,x) = ip{uj,x) 

where : x [0, T] x ^ M, / : J7 x [0, T] x ^ M, g : x [0,r] x R^""^, and 

G : n X [0,T] xR"^ ^ R^""^, 93 : x R'" ^ M are ah jointly measurable, and {Bt : t e [0,T]) 
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is a d-dimensional standard Brownian motion under the measure P. Equation (9) represents 
the general form of the equations (7) and (8) for the corrector tj;j(x,z) and error Rt{x,z), 
respectively. The differential operator C is given by 



d 1 5^ 



dxi 2 ^ dxidxj 
1=1 ^,j=i 

for measurable b : — and a : M™" — g™-^'" ^gmxm jg^otes positive semidefinite sym- 
metric matrices). We will represent these equations in terms of BDSDEs as introduced by 
Pardoux and Peng (1994). Note that for these linear equations it is possible to give a Feynman- 
Kac type representation without using BDSDEs. This is done, for example, in Rozovskii (1990) 
("The Method of Stochastic Characteristics"). However the BDSDE-representation has the ad- 
vantage that it permits us to apply Gronwall's lemma. This would not be possible with the 
method of stochastic characteristics. 

A BDSDE is an integral equation of the form 

Yt = ^+ r f{s,Ys,Zs)ds+ r g{s,Ys,Zs)dBs- C Z^dWs 
Jt Jt Jt 

where B and W are independent Brownian motions. The solution (5^, Zt) will be J-"^^ V /"im- 
measurable. Starting from the notion of BDSDEs, we can define forward-backward doubly 
stochastic differential equations. Let a = a^/^ and 



X*'^ =x+ h{X*f)ds + / a{Xl^^)dWs for s > 
Jt Jt 

Xf'^ =x for s < t 



t 



We then define the following BDSDE 

-dYl'"" = f{s,Xl''')ds + ((7(s,X*'^)ds + G(s,X*'^)y,*'^')(iS, - Zl'^'dWs 

It turns out that this gives a finite-dimensional probabilistic representation for equation (9). This 
is not completely covered by Pardoux and Peng (1994), because we have random unbounded 
coefficients, and because we do not assume the diffusion matrix a to have a smooth square root. 
On the other side, the equation is of a particularly simple linear type. In the remainder of this 
section, we give the precise statement and proof for this representation. This can be skipped at 
first reading. 

We will not be able to get an existence result for classical solutions of the above SPDE from 
the theory of BDSDEs: This is due to the fact that for this we would need smoothness properties 
of a square root of a. But even when a is smooth, in the degenerate elliptic case it does not 
need to have a smooth square root (see, for example, Stroock (2008), Chapter 2.3). Instead 
we will use the existence result of Rozovskii (1990) and only reprove the uniqueness result of 
Pardoux and Peng (1994) in our setting. This will work under Lipschitz continuity of a^/^. 

Define for < t < s < T 

jr^f = a{Bu -Bt:t<u<s) 

and Fj^g as the completion of J^'^ under P. Introduce the space of adapted random fields of 
polynomial growth: 

Definition. Pt(]R'", M") is the space of random fields 

: X [0, T] X ^ 
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that are jointly measurable in (cj, t, x), such that for fixed (i, x), U! i— )• H{uj, t, x) is J^^rp -measurable. 
Further for fixed ui outside a null set, H has to be jointly continuous in (t,x), and it has to satisfy 
the following inequality: For every p > I there is Cp > 0, q > 0, such that for all x G 



E 



sup \H{t,x)\P 

0<t<T 



<Cp(l + |a;|«) 



We make the following assumptions on the coefficients of the SPDE: 

(Sfc) / and g are k times continuously differentiable and the partial derivatives up to order k 
are all in Vt- G is {k + 1) times continuously differentiable and the partial derivatives 
up to order {k + 1) are all uniformly bounded in {oj,t,x). ip is k times continuously 
differentiable, and all partial derivates of order to A; grow at most polynomially. 
We make the following assumptions on the coefficients of the differential operator C: 



(Dfc) beC^{m^,M 

every x G M"'', 
Then we have the following result: 



I, and a is degenerate elliptic: For every ^ G and 

m 



Proposition 4.1. Assume (Sk) and (Dk) for some k >3. Then the equation (9) has a unique 
classical solution i/j in the sense that for every fixed uj outside a null set, ^(w, •, •) G C^'''~^{[0,T]x 
M'^,M), Tp and its partial derivatives are in Pr(]R'",M), and if) solves the integral equation. If ip 
is any other solution of the integral equation, then ip and ij) are indistinguishable. If further f,g 
and (f as well as their derivatives up to order k are uniformly bounded in {uj,t,x), then for any 
p > there exist Cp,q > (only depending on p, the dimensions involved, the bounds on a,b 
and G, and on T), such that for all\a\ < k — 1 and x G M™; 



E 



sup|L'Xt,x)|P 

t<T 



< C7(l + |x|5)E 



loo+SUPll/(*. 

t<T 



I k,oo 



+ sup \\g{t,-)\\l 



t<T 



oo 



Proof. This is a combination of Theorem 4.3.2 and Corollary 4.3.2 of Rozovskii (1990) (The 
claimed bound is only given for the equation in unweighted Sobolev spaces, in Corollary 4.2.2. 
But from that we can deduce the result for the weighted Sobolev case). The only thing we need 
to verify is that our polynomial growth assumption on the coefficients is compatible with the 
Sobolev norm condition there. But if G 7-'t(K™', M"), then for any p > 1 there certainly is an 
r < such that 9 takes its values in the weighted L^-space with weight (1 + |xp)^/^: 



E 



sup 

0<t<T 



\e{t,x)\p{i + \x\'^Y/^dx 



< E 



sup \e{t,x)\p{i + \x\ 

0<t<T 



y^dx 



E 



sup \e{t,x)\p 

0<t<T 



< 



(1 + \x\y/^dx 
J Cp(l + |x|9)(l + \x\'^y/^dx < oo 



for small enough r. 



□ 



Now we combine this result with the theory of BDSDEs: 

Let {Wt : t G [0, T]) be an n-dimensional standard Brownian motion that is independent of 
B. For < t < s, is defined analogously to T^^. For < t < T we set 



-Ft 
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Note that this is not a filtration, as it is neither decreasing nor increasing in t. Introduce the 
following notation: 

is the space of measurable ]R™'-valued processes Y s.t. It is J^t-measurable and 

rT 



E 



\Yt\^dt 







• S^{W^) is the space of continuous adapted 



E 



sup \Yt\ 

0<t<T 



< oo. 

i™'-valued processes Y s.t. Yt £ J't and 

< oo. 



A BDSDE is an integral equation of the form 



(10) 



T 



Yt = ^+ f{s,Ys,Zs)ds+ g{s,Ys,Zs)dB 



Z,dW, 



{Y,Z) win be called solution of (10) if {Y,Z) G 5|(M'") x if|,(M™^") and if the couple solves 
the integral equation. We will also write the equation in differential form: 



-dYt = fit, Yt, Zt)dt + g{t, Yt, Zt)dBt - ZtdWf 

Pardoux and Peng (1994) show that under the following conditions, equation (10) has a unique 
solution: 



for any {y,z) G 



)lxn. 



/(•, -,2/, z) G and g{-, ■,y, z) G 



I>lxfc^ 



• / and g satisfy Lipschitz conditions and g is a contraction in z: there exist L > and 
< a < 1 s.t. for any {uj,t) and yi,y2i zi, Z2- 

\f{t,uj,yi,zi) - f{t,uj,y2,Z2)f < L{\yi - + \zi - Z2p) and 

\g{t,u},yi,zi) - g{t,u},y2,Z2)\'^ < L\yi - y2\^ + a\zi - 22]^. 

Now we want to associate a diffusion X to the differential operator C. To do so, assume that 
(Dfc) is satisfied for some k > 2. Then a := a^^^ is Lipschitz continuous by Lemma 2.3.3 of 
Stroock (2008). Hence for every (t,x) G [0, T] x M™, there exists a strong solution of the SDE 

X*'^ = ^ + / KXl''')ds + J aiXl''')dWs for s>t, 
X*'^ = x for s < t. 
Associate the following BDSDE to (9): 

(11) -dyj'^ = f{s,Xlnds + ig{s,Xl''-)ds + G{s,XlnY,''^)dBs - Zl'^dW^, 



Y, 



t,x 



Under the assumptions (S/t) and (D^) for k >2, this equation has a unique solution. 

Proposition 4.2. Assume (S^) and (Dj.) for some k > 3. Then the unique classical solution 
ip of the SPDE (9) is given by tp{t,x) = Y^'^, where (y*'^,Z*'^') is the unique solution of the 
BDSDE (11). 

We can give exactly the same proof as in Pardoux and Peng (1994), Theorem 3.1, taking 
advantage of the independence of B and W . For the reader's convenience, we include it here. 

Proof. Let ■0 be a classical solution of (9). It suffices to show that 

{^|^{s,Xl'-),Di;{s,Xl^■-)a{Xln :t<s<T) 
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solves the BDSDE (11). Here Dtp is the gradient of 'i/'- For this purpose, consider a partition 
t = to < ti < ■ ■ ■ < tn = T of [t,T]. Then 

n-l 



and 



n-l 



i=0 



= -{ X'f)ds + / X'f)a{X'f)dW^ 

+ f"'\cm,xl';i^) + f{s,xlfj)ds 

J t -i 

+ T'^' {g{s, X^^ ) + )^(., 4^ 

This is justified because X*'^ and are independent and because '\\) grows polynomially, hence 
we can apply Ito's formula. We also used the fact that is a classical solution to (9). If we let 
the mesh size tend to 0, then by continuity of X*'^ and '(/', the result follows. □ 



5. Preliminary estimates 



The notation indicates that the differential operator is only acting on the x-variables. 
The following result will help us to justify the BDSDE-representations on the deeper levels. 
Recah that ptiz,e;x) = E[e{x,Zf)\Zg = z]. 



Proposition 5.1. Assume (HFk^i). Let 9 ^ C 



k.l fmm 



Y^\D^Dl^J{x,z) <C{l + \x\P + \z 

\a\<k \I3\<1 



satisfy for some C,p > 



Then 



{t, X, z) ^ pt{z, 9; x) G CO''=''(M+ x x E", R) 
and there exist Ci,pi > 0, such that for all {t,x,z) G [0,oo) x x M" 

E \D^D^^Pt{z,9;x)\<Cie^^\l + \xr + \zn. 

\a\<k \I3\<1 

If the bound on the derivatives of 9 can be chosen uniformly in x, i.e. 

Y 5^sup|i^^Z5f^(x,z)| <C7(l + |zn, 

\a\<k\l3\<l ^ 

then the bound on the derivatives of pt{z,9; x) is also uniform in x: 



Y 2^ sup D^D^,ptiz,9;x) 



\a\<k \/3\<l 



< Cie^'\l + \z 
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Proof. Note that ptiz,9;x) = K[e{x,Z^)\Zg = z) = K{e{Xt, Zt)\{Xo, Zq) = {x,z)], where 



Zt = Zo+ f f{Xs,Zs)ds+ f g{Xs,Zs)dWs. 
Jo Jo 



In this formulation, the first result is standard. Cf. e.g. Stroock (2008), Corollary 2.2.8. 

The second statement can be proven in the same way as Stroock (2008), Corollary 2.2.8. □ 

In the following Proposition we collect some results from Pardoux and Veretennikov (2003): 

Proposition 5.2. Assume (Hstat) and (HFk,z)- Let 6 G C^''°(M™ x M'^,M) satisfy for some 
C,p>0: 

sup\D^eix,z)\ < C{l + \z\P). 

\a\<k ^ 



Then 



(1) x^poo{e;x) G C^{W^,R). 

(2) Assume additionally that 6 satisfies the centering condition 

/ d{x, z)poo{x, dz) = 

for all X, and that 6 € C 



X W\R) and 



^ Ysup\d^,D^9{x,z)\<C{1 + \z\p). 

|a|<fc|/3|<l ^ 



Then 



roc 

ix,z)^ / pt{z,e;x)dt £ C^^^iR"" xR'',R), 
Jo 

and for every q > there exist Ci,qi > 0, such that for every z gR^ 

D^D^Pt{z,9;x) "dt < Ci(l + |z|'?i). 



E E 

|o|<*-' I/3|<1 



sup 



Proof. The statements are taken from Theorem 1, Theorem 2 and Proposition 1 of Pardoux and Veretennikov 
(2003): 

(1) We get from Theorem 1 of Pardoux and Veretennikov (2003), that for any g > there 
exists Cq > 0, such that for any (x, z, z') G x M" x 



C 

V sup\D^poc{z';x)\ < % 

l"|<A: " i + l^r 

So if we choose q large enough and differentiate poo{0;x) under the integral sign, then 
we obtain the first claim. (Of course here we have to use the growth constraint on and 
its derivatives). 

(2) This follows from the bounds on the derivatives ofpt{z, 6; x) that are given in Pardoux and Veretennikov 
(2003), Theorem 2, formulae (14) and (15): For any fc > there exist Cfc,mfc > 0, such 
that for any {t,x,z) G [l,oo) x M'" x ™" 



^ \D^,D^pt(z,e;x) 
H<k |/31<1 



<Ck 



l + \z 



rufc 



k • 
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We combine this estimate with Proposition 5.1, from where we obtain for {t,x,z) G 
M+ X M™ X 

\a\<k\l3\<l 

We choose k such that qk > 1 and use the first estimate on [l,oo) and the second 
estimate on [0, 1). The result follows. 

□ 

We will also need some moment bounds for the diffusions and . 

Proposition 5.3. Assume (Hgtat) oind that the coefficients h and a and f and g of the fast and 
slow motion are bounded and globally Lipschitz continuous. Then for any p > 1 there exists 
Cp > 0, such that 

sup E[\Z^\P\{X^o,ZI) = ix,z)] < Cp(l + l^n. 

(t,£,a;)e[0,oo) X [0,1] xR™ 

Also, for every T > and every p > 1 there exist C{p,T),q > 0, such that 

sup E[|Xfn(Xo^Zg) = {x,z)]<C{p,T){l + \xn. 

(t,£)G[0,T]x[0,l] 

Proof. The first claim can be proven exactly as in Veretennikov (1997): First write Zf := Z^^2- 
Then 

dZ! = dZ[)dt + g{Xl,^, dZt)dW! 

where Wf := l/eWi,2^ is a Wiener process. Next, introduce the same time change as in 
Pardoux and Veretennikov (2001), page 1063: k{x,z) := [^(x, z)*z|/|2;|, 7^(t) := /q k'^{X^2s, Zg)ds, 
T%t) := (7^)-i(t). Define := Z'^.^^y Then 

dZf = K-\Xl2„ Zf)f{Xl,„ Zt)dt + K-\Xl2,, Zt)g{X%^, Zf)dW! 

with a new standard Brownian motion W^. Now we are in a position to just copy the proof of 
Lemma 1 in Veretennikov (1997) (which we do not do here) to get the first result. 

The second claim is obvious, because the coefficients of are bounded. □ 

Now we we are able to impose conditions on the coefficients of the diffusions that guarantee 
smoothness of the coefficients of C. Recall that C was defined as 




where b = poo {b;x) and a = poo (o'a* ; x) . 

Proposition 5.4. Assume (HF^^s), (HSkfl), o-n^d (HOkfl)- Then 

b G C^{R"',W^),ae Cfe^(M™,S'"^™),/i G Cfe^(M™,M'=) 

Proof. All the terms of b, a and h are of the form Poo{0; x). So by Proposition 5.2, we only need 
to verify that the respective 9 are in C'^''^ and satisfy the polynomial bound 

sup\D^9{x,z)\ < Cil + \z\P) 

\a\<k 

for some C,p > 0. But we even assumed them to be in C^'^, so the result follows. □ 



D'^Dlpt{z,e-x) < Cie^i*(l + |z|Pi). 
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6. Proof of the main result 

We will find convergence rates for the corrector and remainder terms that are expressed in 
terms of and its derivatives. So now we give bounds on and its derivatives in terms of 
the test function ip. This is necessary, because we do not only want to show convergence of the 
filter integrating fixed test functions, but with respect to a suitable distance on the space of 
probability measures. 

Lemma 6.1. Let k > 2 and assume b,a,ip e C^*^+\ and h G C^^'^. Then v° G C°'^([0,T] x 
and for any p > 1 there exist Cp, q > 0, independent of ip, such that for all x G M™; 

\a\<k 

In particular, and all its partial derivatives up to order (0, A;) are in VxiM^,^)- 

Proof. This is a simple application of Proposition 4.1, noting that the equation (6) for is of 
the type (9) with f = 0, g = 0, and G = h* . □ 



sup iD'^vUx)]^ 

0<t<T 



<Cp{i + \xn\Ml. 



We will prove L^-convergence of -0 and R separately: 



Lemma 6.2. Let k,l > 2. Assume (Hgtat), (HFk+i,i+i), (HSk+i,i+i), and (HOk+i,i+i)- Also 
assume £ C°'^+i([0,r] X ]R'",]R), and that all its partial derivatives in X up to order k + 1 are 
in ■Pt(M"',K). Finally assume a,b,h £ C^. Then Tp^ G C°''=''([0, T] x M"^ x M",R), and as 
well as its partial derivatives up to order {0,k,l) are in 'Pr(]R'" x M",M). For any p > 1 there 
exist Cp, q > 0, independent of ip, such that for any {x, z) G M'""*"" and any e G (0, 1) 



^sup^E[|Z)^^i(x,z)r] <eiC7p(l + |zr) ^ 

0<|a|<fc+l 



o<4<r 



|o|<fc-l 

Proof. ^\{x,z) solves the BSPDE 



sup |25>?(x)|^ 

0<t<T 



(12) -d^\,\{x,z) = 

^\{x,z) = ^. 



\Lp^\{x,z)^{Ls-C)v'i{x) 



dt + \^(x, z) - h(x)\ * Vtix)dBt 



Existence of the solution ip^ and its derivatives as well as the polynomial growth all follow from 
Proposition 4.1. Write Z^'^'^*'^) for the solution of the SDE 

dZl^x,{t,z) ^ 1 j^^^ Zf'^'(*'^))ds + -^g{x, ZI^'=^^^^''^)dWt, s > t 



Ze,x,(t,z) 



s<t. 



We consider (x, Z^'^'^*'^^) as a joint diffusion, just as in the proof of Proposition 5.1 {x has 



generator 0). By Proposition 4.2, the solution of (12) is given by 6^ 
to the BDSDE 



{t,x,z){l) 



the unique solution 



- = (Cs{-, Zf'^'(*'")) - £)t;0(x)(is + (h{x, Zf'^'(*'")) - h{x)) * vl{x)dB 



0. 
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We will drop superscripts {t,x,z) for 0^J''^'^^^^^ and write 9^ instead. Similarly, we write Z, 
instead of Zs'^'^^'^\ iljjixjz) is J-"^^-measurable, hence, so is 9^. 



£,X 



91=E[91\T,^t] 



E 



{Cs - cy,{x)ds\T^T 



+ E 



[h{x,Zr) -h{x)Y vl{x)dB,\T^T 



E 



W and B are independent, therefore is a Brownian motion in the large filtration {F^VF^j, . 



s £ [0, T]), hence E jf''dWs\F^ V Fl 



T 



0, and by the tower property 



E 



j%l^^^^dWs\F^T 



is -F^j^'^^^surable and C has deterministic coefficients. Thus 



E 



Cv%x)ds\F^T 



E[£^0(x)|J-i^r] ds 



{ m Q m 

{ ^Poo{bi;x)—v'^{x) + ^ Poo{{cr(J*)ij; 
(i=i i,j=i 



Since Z^'^ is independent of B, 



E 



r Csi;Zinv'six)ds\F,'lT\ = rE[Csi;ZinvU^)\F^,T]ds 
Jt J Jt 



I 5^E Zr)] Q^/si^) +2^^ Zr)] Q^/six) > ds 



\ Y.ns-t)/e{zA;x)—V^^{x) + 2 

i=l * i,i=l 



Q2 

_t^l^{z,{(ja*)ij;x)^—vl{x) ) ds, 
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SO 



E 



{Cs - C)vl{x)ds\T^^T 



J w2P{s-t)/e{z,bi - Poo{hi]x)]x)—vl{x) 

1 " 5^ I 



{poa{-',x) does not depend on z, so can be brought inside the integral p(^s-t)/eiz: ']^) 

m „ T—t 



< £ 



e 

+ 2 



d 



Yl Pu{z,hi - Poo{h;x)-x)—vl^j^t{x)du 



V / pu{z,{aa*)ij -poo{{(y<y*)ij;x);x)- D°„+t(x)(in 

ij^l Jo OXiXj 

oo 



poo 

- \puiz,bi - poo{bi;x);x)\du sup 

~l Jo t<s<T 



_d_ 

dx, 



V'six) 



+ n / \Pu{z,{(ja*)ij - poo{{cra*)ij;x);x)\du sup 
^ t<s<T 



i,j=i - " 

(/ ^ Pooif: x) is centered, so by Proposition 5.2, (2):) 



v'six) 



j=i 



t<s<T 



+ ^ sup 



«J=1 



t<s<T 



V'six) 



and therefore finally 



E 



E 



(13) 



<ePC2il + \z\'^^)E 



E 

i=l 



sup 

t<s<T 



d_ 

dx. 



+ E 



sup 

t<s<T 



52 



dx 'i X J 



V'six) 



Next, using again G ™d that Z^'^ is independent of B, 



E 



^ 1 /-T _ ^ 

[h{x,Zl''')-h{x)]*v^,{x)dBs\J't^T = E[[h{x,Zl''')-h{x)]*v^,{x)\T^^T]dBs 



P(s-t)/eiz, h-h; x)*v^^{x)dBs. 



For t < r < T , r ^ P{s-t)/e{z,h — h; x)*Vg{x)dBs, is a martingale w.r.t. {J^^^ '■ ^ [^1^]) 
time is run backwards. Hence by the Burkholder-Davis-Gundy inequality, 



E 



P[s-t)/e{z, h-h; x)*v^^{x)dBs 



p' 








< CpE 


ij^ P{s-t)/e{z,h 
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where 



P{^s-t)/e{z, h-h; x)*i)°(x)dSs) 



\p{s-t)/e(z,h - h;x)*v°^{x)\^ ds 

T-t 

/ \pu{z,h — h;x)\^ du sup 

JO t<s<T 

^£ / \pu{z^h — h;x)\ du sup 
JO t<s<T 

< £6*3(1 + 1^1'^'') sup |'y^(x)|^ (by Proposition 5.2, (2), since h ~h ia centered). 

t<s<T 



Therefore, 



(14) E 



T 



P{s^t)/e{z^ h-h; x)*v"^{x)dB 



< £264(1 + |z|^^)E 



sup |i's(a^)| 
t<s<T 



Combining (13) and (14), 



<e^C'4(l + |2|^^) E 

|a|<2 



sup |Z)>°(x)r 

t<s<T 



Next, consider a first order x-derivative of 9}: 



^ 9\ = 4r r^[Cs-C]v^,{x)ds + -^ [^E[h{x,Zin-h{x)]*v^,ix)dBs 



dxu dx 



k Jt 



k Jt 
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Interchanging order of differentiation and integration, 



d 



T 



i=l 



_d_ 

dxi 



d 



■Pu{z,bi - pooibf, x); x)—v'^^+t{x) 



+Pu{z, bi - Poo{bux); ^) Q^^^. '^eu+ti^) (■ du 



-IE 



( d u 

^3 



+Puiz, {cra*)ij - poo{icra*)ij;x);x] 



i=l 



d 



dxi 



■Pu{z,bi - pooibi;x);x] 



du sup 

t<s<T 



+ / \puiz,bi - poo(bi;x);x)\du sup 

JO t<s<T 



dxkXi 



-IE 



d 



dxi 



■Puiz, {aa*)ij - poo{{cra*)ij;x);x] 



du sup 

t<s<T 



+ / \pu{z,{aa*)ij - poo{{cra*)ij;x);x)\du sup 

JO t<s<T 



dx j X j 

Q3 



dx ■i X j X 



<eC,il + \zn sup 



l</3<3 



t<s<T 



where the last step follows again from Proposition 5.2, (2). Taking expectation. 



(15) E 



d 



dxk 



E [Cs - C] 'i;°(x)ds 



< ePC76(l + |z|^«) E 

l</3<3 



sup 

t<s<T 



Next, by (HOfc^;), we can interchange the order of ordinary differentiation and stochastic inte- 
gration (cf. Karandikar (1983)): 



E 



d 



dxk 



E[/i(x,Zf'^)-/i(2;)]*7;°(x)(i5, 



E 



< CpE 



d 

dxk 



(E [h{x, Zl^"") - h{x)] * u°(x)) dB, 



d 



dxi 



(E [/i(x,Zf'-)-M^)]*^0(x)) 



p/2 



ds 
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where 



d 



dxk 

T-t 



E[Mx,zr)-Mx)]%°(x)) 

d 



ds 



d 



du 



< 2e 



d 



dxi 



Pu{z,h - h;x) 



<eC7{l + \z\'^')\ sup |f°(a;)f+ sup 



Veu+t{^)\ du+ I \pu{z,h - h;x)\ 
d 



d 



dx- 



du 



t<s<T 



t<s<T 



dXi 



The last step follows once again from Proposition 5.2, (2). So 



E 



d 



dxk \Jt 

(16) <eiC8(l + |z|''«)|E 

Combining (15) and (16) 



E [h{x, Zf'^) - h{x)] * vl{x)dB, 



sup |fs(x)r 


+ E 


sup 




P 


1 


t<s<T 




t<s<T 







E 



dxk * 



a<3 



sup |i?>°(x)|^ 

t<s<T 



Iterating these arguments for the higher order derivatives of 0^, 



|a|<fc-l 



|l?10 



) E ^ 

la|<fc+l 



sup p>0(x)|^ 

t<s<T 



□ 



Lemma 6.3. Let k,l > 3. Assume (HFk^i), (HSk,i), and (HOk+i^i+i)- Also assume tp'^ G 
C°'''+2''([0,r] X X M'\R) and that all its partial derivatives up to order {0,k + 2,1) are in 
Pr([0,T] X M^jM). Then for any p > 1 there exists Cp > 0, independent of (p, such that for any 
(x, z) G M'^+", any e £ (0, 1), and any t G [0, T] 



n\Rt{x,zW]<CpY. / IE[E[|l),>l(x',z')r](^,^,^ 

|a|<2 



ds. 



1 {x\z')=(X- 

Proof. Rt{x,z) solves the BSPDE 

(17) - dRt{x, z) = {C'Rtix, z) + Csi^Ux, z)) dt + h{x, z)* {ipjix, z) + Rt{x, z)) dBt, 

Rt{x,z) = 0. 

Existence of the solution R and its derivatives, as well as the polynomial growth all follow from 
Proposition 4.1. By Proposition 4.2, the solution of (17) is given by 0^^'^'^'^^'^^ ^ the solution to 
the BDSDE 

+ /i(Xf'(*'^), Zf'(*'^))*e(*'^'")(2)^5^ _ ^t,^,zdWs - 6'f''dVs 



We will drop superscripts {t,x,z) for g^*'^'^)^^)^ {t^z) for and {t,x) for X^'^*'^^. 
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Rt{x,z) is J-"^^-measurable, hence, so is 9^. As before, the stochastic integrals over dV and 
dW vanish when we take conditional expectation with respect to J'^j'- Thus 



(18) 



E 



+ E 



Cs^PKXlZ|)ds 

^ h{xizireldB, 



-r-B 



+ E 



h{Xl,Ztri;l{XlZl)dB 



-r-B 



-r-B 
■J t,T 



Consider each term separately: 
Csi^liXl,Zl)ds 



E 



E 



-tB 
■J' t,T 



< E 



£si^l{XlZl)ds 



<{T- t)P-^ E 



biiXt, Z!)— + - ^ {aa^UXt, Z^) 



dxi ' 2 . 

1=1 '^-,3=1 



^lixizT, 



ds 



< C 



1=1 

<C2f^ ^[\D'^,i^l{XlZt)\']ds. 



n2 P' 

i^l{xiz[ 



dxiX 



ds 



l<|a|<2 



Note that Z| and XI are J^, -measurable, is -F^ ^--measurable, and B and W are independent. 
Thus 

E [\D?^i^l{Xl Zl)\'] = E [E [\D°^^l{Xl Zl)\'\ Ff]] = E [e ^01'] 



(x',y)=(Xf,Zf) 



so that 



E 



E 



Cs^l^liXlZDds 



-r-B 
■J' t,T 



(19) 



T 



E 



(x'2')=(Xf,Zf) 



ds. 



l<|a|<2 ' 

Next, by Jensen's inequality, the tower property, and the Burkholder-Davis-Gundy inequality. 



E 



E 



h{XlZiyi;liXl,Zl)dB 



-r-B 
■J t,T 



< E 



T 



hixi,ziy^lixi,zi)dB, 



T 



il h{xi,ziyijl{xizi)dB 



where by Holder's inequality and the Cauchy-Schwarz inequality 



T 



{ hixizin*rs{x!,zi)dB 



hixizin*i^l{xizi)\Us 



<c, \h{xiziml{xizi)\Pds. 



So by the same arguments as for the first term, 



E 



E 



h{XlZir^l:l{XlZl)dB, 



B 



t,T 



(20) 



<Ca e 



E[|Vl(x',z')| ](,.,,o=(^|,2|) 



ds. 
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Finally, using Burkholder-Davis-Gundy in the second line, and Cauchy-Schwarz in the third 
line 



E 



E 



h{xi,zireUB 



t,T 



< E 



< CpE 



< CpE 



[h{XlZl)r9^JB, 



\hixi,zirei\'ds 



\h{xizi)\'\e'j'ds 



(21) 



<c5\\h\\'oo£no!m. 



Combining (18) with (19), (20), and (21) 



\a\<2 



E 



E[\D^^Pl{x',z')f] , 



(x',z')={Xl,Zl) 



ds + C^\\h\\ 



By Gronwall, 



\a\<2 



(x',y)=(X|,ZJ) 



< 



\a\<2 



E 



E[|Z)>l(x',z')r].,,,, 



(x',z')={Xl,Zl) 



ds 



where we choose Cj so that this holds for every t £ [0, T] (replace e'^'^ t)C5\\h\\oa gTC5||/i||oo^_ □ 

Now we can collect all these results, to obtain the first step towards Theorem 3.1. 

Lemma 6.4. Assume (Hstat), (HFs^a), (HSj^i), (H0s,4), and that ip G Cj(R'",M). Then for 
every p > 1 there exists C,qi,q2 > 0, independent of if, such that 



sup E[\vl{x,z) -v^t{x)\P] <sP/^C il + \x\i' + |2|'^') 
0<t<T 



p 

4, CO' 



Proof of Theorem 3.1. We track the necessary conditions backward from Lemma 6.3. 

1. For the solution R given in Lemma 6.3 to exist and satisfy the stated bound, we need 
(HF3,3), (HS3,3), (H04,4), andV-^ S C°'^'3([0,r] xM'^xM^.M). The polynomial growth condition 
will be satisfied anyways. 

_2._For i)^ to be in C°'5'3([0,r] x M"^ x M",M), we need (Hstat), (HF6,4), (HS6,4), (H06,4) and 
a,h,h G C|. We also need G C°'^([0, T] x ]R"*,]R). Again, the polynomial growth condition 
will be satisfied. 

3. For to be in C°'6([0, T] x M'", M) we need a,h,Lpe Cl and li G Cf. 

4. For a, 6 to be in we need (HF7^3) as well as (HSy^o) by Proposition 5.4. Similarly we 
need (HF8,3) as well as (H08,o) for h to be in Cf. 

5. So sufficient conditions are (Hgtat), (HF8^4), (1187^4), (H08,4). In that case we obtain from 
Lemma 6.1 



(22) 



la|<4 



sup |L>"t;?(x)|P 

0<t<T 



<c,{i + \xnM\ioo. 
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From Lemma 6.2 we obtain 



(23) IE[l^>t(x,2)r] <e"^C2{l + \zn E 

|a|<2 0<*<^ |a|<4 

From Lemma 6.3 we get 



sup \D'^Vt{x)\ 

0<t<T 



E 



{x',z')=(X, 



ds. 



(24) E[\Rt{x,z)f]<C^ Y 

\a\<2 ' 

Combining (22), (24), (24), we get for any t G [0,T] (by the time-homogeneity of and Z'^ 

Em{x,zW]+E[\i;lix,z)\P] 



(25) 



<eP/^cJl+ sup E[\Xir + \Zin^^\iX^,,Z^,) = {x,z)]] \Ml^. 

\ 0<s<T J 



From Proposition 5.3 we obtain 

sup E + iZf'^^^KXg, Zg) = (x, z)] < C5(l + 1x1"?^ + \z\'^^). 

0<s<T 

Noting that the right hand side in (25) does not depend on t G [0, T], 

sup E[\Rt{x,z)\P]+ sup E[\i;l{x,z)\P] 

0<t<T 0<t<T 

<eP/2c6(l + |x|''3 + |z|«4) 



p 

4,oo' 



Finahy 



sup E[\vtix,z)-v^ix)\P]<Cr{ sup E[\Rtix,zW] + sup EMix,zW] 



0<t<T 



n<t<T 



<eP/2c78(l + |x|® + |z|94) 



0<t<T 



P 

4,oo' 



which completes the proof. 



□ 



Now we recah that ah the calculations up until now were under the changed measure P^. 
We only wrote P and B to facilitate the reading. So let us transfer the results to the original 
measure O. 



Then for 



Lemma 6.5. Assume (Hstat), (HF^^a), (HSj^a), (HOs,a), and that ip G Cl{W ,1 
every p > 1 there exist C,qi,q2 > 0, independent of ip, such that 

sup EQ[\vt{x,z) - v^ix)\P] < eP/'C{l + {x^ + \zn IMl,^. 

0<t<T 

Proof. This is a simple application of the Cauchy-Schwarz inequality in combination with Gron- 
wall's lemma: 



vt{x,z)-v^{x)\P]=E, 



\vt{x,z)-v?{xr^ 



<Ep4|t;f(x,z)-7;°(x)|2f]V2]Ep, 



dl 



1/2 



SO we see that the result is true by Lemma 6.4 as long as the second expectation is finite. Recall 
that we had defined the notation 

= Dt = e^p( [\{XlZtrdY,'-l- 



dF^ 



UXl,Zl)\'ds 
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So satisfies the SDE 



Hi 



Dl = l. 



dDt = Dlh{Xt,ZirdY, 
Since under P*", is a Brownian motion, we get by Ito-isometry 



Eip 



so that by Gronwah Epe 



{DlY\h{XlZirds 



< 



,ElF 



[Dlfds 



< oo 



□ 



Lemma 6.6. Assume (Hstat), (HFs^^), (HSj^i), (HOg^^), that ip G Cj, and that the initial 
distribution Q(xg,zg) ^(^s finite moments of every order. Then for every p > 1 there exists 
C > 0, independent of ip, such that 

Proof. As we aheady described in the introduction, we obtain from Lemma 6.5 

PI 



(vfiix, z) - Vo{x))Q(x-^z-){dx, dz) 



< J EQ[\v'oix,z)-v^o{x)m(^xt„z^^)idx,dz) 
<eP^'Ci I {l + \xr + \z\i^)q^x^^^zdd^,dz)\Ml 



4,oo' 



□ 



The convergence of the actual filter, i.e. of tt^'^ to vr'', now follows exactly as in Chapter 9.4 
of Bain and Crisan (2009). For the sake of completeness, we include the arguments. 



pr(i)rn+iEQ[iA°(i)r^]}<oo 



Lemma 6.7. Let p > 1. Then 

sup 

eGO,l,tG[0,T] 

as long as and are well defined and h is bounded. 

Proof. We give the argument for Eq[|p^'^(1)|~p], Eq[|p^(1)|~^] being completely analogue. We 
have 

1/2 



\pra)\- 



-2pl 1/2 



Ep 



dl 



We showed in the proof of Lemma 6.5 that the second expectation is finite. Note that x i— )• x 
is convex. Therefore by Jensen's inequality, 

-2pl 



Epe[|4'^(l)|-2f]=Ep. 

< Ep. 

< Epe 



exp / h{Xl,ZirdY,' 



Ep 



exp / h{Xl,ZtrdY^^ 



1 



\h{Xl,Zl)\'ds 



yt 



\h{XlZl)\''ds 



-2p 



dl 



-2p 



Er 



dF" 



2p+l 



The result now follows exactly as in the proof of Lemma 6.5, because for Df 
have 

dDl = -h{X^,ZtrdBt, Dl = l 



Tt we 
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and B is a Brownian motion under Q. 

Define for any measurable and bounded test function if : 



□ 



p?(i)' 



Recall that vr^'^ was defined analogously with p^'^ instead of p^. We then have 

Lemma 6.8. Assume (Hgtat), (HF^^4,), (HSj^^), (HO^^i), and that the initial distribution Q(^x^,z^) 
has finite moments of every order. Let p > 1. Then there exists C > such that for every (/? G 

EQ[\7rr{^)-^Um<e^^'C\Mloo- 
Proof. In the third line we use that vr^'^ is a.s. equal to a probability measure. 



pTi'f) - Pri'^) 



pW 



< a, E. 



Pt (^) - Pri'^) 



pW 



+ 



pm 



pr{-^)-pW 



< C, {E^[\p\^)\-^^\f'^ ( Eq \\p'/{^) - pU^) 



|2p" 



pUi) 

1/2 



+ 



.Ec 



|2p" 



1/2 



< e^^^Cl||(yy||4,oo, 

where the last step follows from Lemma 6.6 and Lemma 6.7. □ 

Since the bound only depends on ||(/3||4,oo5 we can replace the assumption G by (/9 G C^: 
Just approximate G by (^9" G in the || • ||4^oo-norm, and take advantage of the fact that 
vr^'^ and ttj. are a.s. equal to probability measures. Therefore we have 

Corollary 6.9. Assume (Hgtat), (HF^^^), (HSi^^), (HOs^^), and that the initial distribution 
Q(xg,zg) has finite moments of every order. Let p > 1. Then there exists C > such that for 
every ip ^ C^, 

EQ[\n^/{^)-7rUm<e^^'C\Mloo- 

Now note that there exists a countable algebra (v?j)«eN of functions that strongly sep- 
arates points in M™. That is, for every x G M™ and 6 > 0, there exists i G N, such that 



inf, 



y:\x~y\>S 



— <fi{y)\ > 0. Take e.g. all functions of the form 



exp 



with n G N, qj G Q+, Xj G Q™. By Theorem 3.4.5 of Ethier and Kurtz (1986), the sequence 
is convergence determining for the topology of weak convergence of probability measures. 
That is, if pn and p are probability measures on such that lim„_j.oo Pni^i) = p{Vi) for every 
i G N, then pn converges weakly to p. 

Define the following metric on the space of probability measures on M™: 



\v{ipi) - p{ipi)\ 
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Because (ipi) is convergence determining, the metric d generates the topology of weak conver- 
gence. Therefore the proof of Theorem 3.1 is complete. □ 

7. Conclusion and future directions 

This paper presented the theoretical basis for the development of a lower-dimensional parti- 
cle filtering algorithm for the state estimation in complex multiscale systems. To this end, we 
combined stochastic homogenization with nonlinear filtering theory to construct a homogenized 
SPDE which is the approximation of a lower-dimesional nonlinear filter for the "coarse-grained" 
process. The convergence of the optimal filter of the "coarse-grained" process to the solution 
of the homogenized filter is shown using BSDEs and asymptotic techniques. This homogenized 
SPDE can be used as the basis for an efficient multi-scale particle filtering algorithm for es- 
timating the slow dynamics of the system, without directly accounting for the fast dynamics. 
The numerical solutions based on this scheme, that enables efficient incorporation of observation 
data for estimation of the coarse-grained ("slow") dynamics, as it is applied to several higher 
dimensional multiscale applications are presented in Namachchivaya et al. (2012). 

Even though this paper deals with just one widely separated characteristic time scale, one 
can extend this work to incorporate a more realistic setting where the signal has more than one 
time scale separation. As before we let e be a small parameter that measures the ratio of slow 
and fast time scales. Consider the signal and observation processes governed by: 

dzi = ^f{z!,x!) + -^g{z!,x!)dWt, z'^ = 

(26) dXf = h'{Z!,Xt) +b{Z!,Xf) +a{Z!,Xf)dVt, X^ = x, 

dYf = h{Zl, Xf) dt + dBt, = 0, 

where W, V and B are independent Wiener processes and x and z are random initial conditions 
which are independent of W, V and B. It is important to realize that there are several scales 
in (26), even the slow process Xf has a fast varying component. This case is important, in 
particular, for applications in geophysical flows and climate dynamics. The drift term b and the 
diffusion a cause fluctuations of order order 1, and the drift term / and the diffusion g cause 
fluctuations of order order whereas the drift term causes fluctuations at an intermediate 
order e~^. It was found that when the average of b^ with respect to the invariant measure of 
the fast component Zf (for the flxed slow component) is zero, the limit distribution of the slow 
component (away from the initial layer) can also be obtained in terms of the solution of some 
auxiliary Poisson equation in the homogenization theory. However, a unified framework to deal 
with term in developing a lower-dimensional nonlinear fllter for the "coarse-grained" process 
is still not available. 
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